- Home
- Search Results
- Page 1 of 1
Search for: All records
-
Total Resources5
- Resource Type
-
0004100000000000
- More
- Availability
-
50
- Author / Contributor
- Filter by Author / Creator
-
-
Chen, Howard (5)
-
Chen, Danqi (3)
-
Narasimhan, Karthik (2)
-
Annala, Toni (1)
-
Aragon, Max_Jameson (1)
-
Arora, Sanjeev (1)
-
Artzi, Yoav (1)
-
Chevalier, Alexis (1)
-
Deshpande, Ameet (1)
-
Frieder, Simon (1)
-
Gao, Tianyu (1)
-
Geng, Jiayi (1)
-
Graf, Victoria (1)
-
Jimenez, Carlos (1)
-
Kalyan, Ashwin (1)
-
Machado, Simon (1)
-
Misra, Dipendra (1)
-
Mizera, Sebastian (1)
-
Murahari, Vishvak (1)
-
Pan, Jane (1)
-
- Filter by Editor
-
-
& Spizer, S. M. (0)
-
& . Spizer, S. (0)
-
& Ahn, J. (0)
-
& Bateiha, S. (0)
-
& Bosch, N. (0)
-
& Brennan K. (0)
-
& Brennan, K. (0)
-
& Chen, B. (0)
-
& Chen, Bodong (0)
-
& Drown, S. (0)
-
& Ferretti, F. (0)
-
& Higgins, A. (0)
-
& J. Peters (0)
-
& Kali, Y. (0)
-
& Ruiz-Arias, P.M. (0)
-
& S. Spitzer (0)
-
& Sahin. I. (0)
-
& Spitzer, S. (0)
-
& Spitzer, S.M. (0)
-
(submitted - in Review for IEEE ICASSP-2024) (0)
-
-
Have feedback or suggestions for a way to improve these results?
!
Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher.
Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?
Some links on this page may take you to non-federal websites. Their policies may differ from this site.
-
Chevalier, Alexis; Geng, Jiayi; Wettig, Alexander; Chen, Howard; Mizera, Sebastian; Annala, Toni; Aragon, Max_Jameson; Rodriguez_Fanlo, Arturo; Frieder, Simon; Machado, Simon; et al (, International Conference on Machine Learning)
-
Deshpande, Ameet; Jimenez, Carlos; Chen, Howard; Murahari, Vishvak; Graf, Victoria; Rajpurohit, Tanmay; Kalyan, Ashwin; Chen, Danqi; Narasimhan, Karthik (, Computational linguistics Association for Computational Linguistics)
-
Yao, Shunyu; Chen, Howard; Yang, John; Narasimhan, Karthik (, Advances in neural information processing systems)Most existing benchmarks for grounding language in interactive environments either lack realistic linguistic elements, or prove difficult to scale up due to substantial human involvement in the collection of data or feedback signals. We develop WebShop – a simulated e-commerce website environment with 1.18 million real-world products and 12,087 crowd-sourced text instructions. In this environment, an agent needs to navigate multiple types of webpages and issue diverse actions to find, customize, and purchase a product given an instruction. WebShop provides several challenges including understanding compositional instructions, query (re-)formulation, dealing with noisy text in webpages, and performing strategic exploration. We collect over 1,600 human trajectories to first validate the benchmark, then train and evaluate a diverse range of agents using reinforcement learning, imitation learning, and pre-trained image and language models. Our best model achieves a task success rate of 29%, which significantly outperforms rule heuristics but is far lower than expert human performance (59%). We also analyze agent and human trajectories and ablate various model components to provide insights for developing future agents with stronger language understanding and decision making abilities. Finally, we show our agent trained on WebShop exhibits non-trivial sim-to-real transfer when evaluated on amazon.com and ebay.com, indicating the potential value of our benchmark for developing practical web agents that can operate in the wild.more » « less
-
Chen, Howard; Suhr, Alane; Misra, Dipendra; Snavely, Noah; Artzi, Yoav (, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR))
An official website of the United States government

Full Text Available